Using J2EE/NET Clusters for Parallel Computations of Join Queries in Distributed Databases
نویسندگان
چکیده
In here we consider the problem of parallel execution of the Join operation by J2EE/.NET clusters. These clusters are basically intended for coarse-grain distributed processing of multiple queries/business transactions over the Web. Thus, the possibility of using J2EE/.NET clusters for fine-grain parallel computations (parallel Joins in our case) is intriguing and of practical interest. We have developed a new variant of the SFR algorithm for parallel Join operations and proved its optimality in terms of communication/execution-time tradeoffs via a simple lower bound. Two variants of SFR algorithm were implemented over J2EE and .NET platforms. The experimental results show that despite the fact that J2EE/.NET are considered to be platforms that use complex interfaces and software entities, J2EE/ .NET clusters can be efficiently used to execute the Join operation in parallel.
منابع مشابه
The .NET implementation of a Parallel Query Engine
Parallel database systems implemented using Web technologies, such as J2EE or .NET are one of the most complicated and intriguing distributed systems. These systems combine parallel evaluation of a single query with the concurrent evaluation of multiple queries made by a dynamic set of remote clients. Typically, these systems are implemented using clusters of machines hosting three types of ser...
متن کاملComplex Query JOIN Optimization in Parallel Distributed Environment
The research work covers the query optimization concept in parallel distributed environment. The queries considered are select-project-join (SPJ) queries with large databases. The main query operation considered for research is JOIN operation of the query. For fast execution of a complex query, JOIN operation time needs to be minimized. Different JOIN operation algorithms such as Network Byte O...
متن کاملRelational Databases Query Optimization using Hybrid Evolutionary Algorithm
Optimizing the database queries is one of hard research problems. Exhaustive search techniques like dynamic programming is suitable for queries with a few relations, but by increasing the number of relations in query, much use of memory and processing is needed, and the use of these methods is not suitable, so we have to use random and evolutionary methods. The use of evolutionary methods, beca...
متن کاملEfficient Multi-way Theta-Join Processing Using MapReduce
Multi-way Theta-join queries are powerful in describing complex relations and therefore widely employed in real practices. However, existing solutions from traditional distributed and parallel databases for multi-way Theta-join queries cannot be easily extended to fit a shared-nothing distributed computing paradigm, which is proven to be able to support OLAP applications over immense data volum...
متن کاملScalable Distributed Data Structures for High-Performance Databases
Present databases, whether on centralized or parallel DBMSs, do not deal well with scalability. We present an architecture for Wintel multicomputers termed AMOS-SDDS, coupling a high-performance main-memory DBMS AMOS-II and a manager of Scalable Distributed Data Structures SDDS-2000. SDDS-2000 provides the scalable data partitioning in distributed RAM, supporting parallel scans with function sh...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- JDIM
دوره 3 شماره
صفحات -
تاریخ انتشار 2005